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Central Trajectories 

Marc van Kreveld Maarten Loftier* Frank Staals* 


Abstract 

An important task in trajectory analysis is clustering. The results of a clustering are often summarized by a single 
representative trajectory and an associated size of each cluster. We study the problem of computing a suitable 
representative of a set of similar trajectories. To this end we define a central trajectory C, which consists of 
pieces of the input trajectories, switches from one entity to another only if they are within a small distance of each 
other, and such that at any time t, the point C(t ) is as central as possible. We measure centrality in terms of the 
radius of the smallest disk centered at C(t) enclosing all entities at time t, and discuss how the techniques can be 
adapted to other measures of centrality. We first study the problem in K 1 , where we show that an optimal central 
trajectory C representing n trajectories, each consisting of r edges, has complexity 0(rn 2 ) and can be computed 
in 0(rn 2 log n) time. We then consider trajectories in with d > 2, and show that the complexity of C is at most 
0(r?i 5 / 2 ) and can be computed in 0(rn 3 ) time. 
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Fig. 1: (a) Every trajectory has a peculiarity that is not representative for the set. (b) Taking, for example, the 
pointwise average of a set of trajectories may result in one that ignores context. 


1 Introduction 

A trajectory is a sequence of time-stamped locations in the plane, or more generally in W l . Trajectory 
data is obtained by tracking the movements of e.g. animals [6, 10, 16], hurricanes [24], traffic [21], or 
other moving entities [12] over time. Large amounts of such data have recently been collected in a variety 
of research fields. As a result, there is a great demand for tools and techniques to analyze trajectory data. 

One important task in trajectory analysis is clustering-, subdividing a large collection of trajectories 
into groups of “si mi lar” ones. This problem has been studied extensively, and many different techniques 
are available [7, 14, 15, 20, 25]. Once a suitable clustering has been determined, the result needs to be 
stored or prepared for further processing. Storing the whole collection of trajectories in each cluster 
is often not feasible, because follow-up analysis tasks may be computation-intensive. Instead, we 
wish to represent each cluster by a signature: the number of trajectories in the cluster, together with a 
representative trajectory which should capture the defining features of all trajectories in the cluster. 

Representative trajectories are also useful for visualization puiposes. Displaying large amounts of 
trajectories often leads to visual clutter. Instead, if we show only a number of representative trajectories, 
this reduces the visual clutter, and allows for more effective data exploration. The original trajectories 
can still be shown if desired, using the details-on-demand principle in information visualization [23]. 

Representative trajectories. When choosing a representative trajectory for a group of similar trajecto¬ 
ries, the first obvious choice would be to pick one of the trajectories in the group. However, one can argue 
that no single element in a group may be a good representative, e.g. because each individual trajectory 
has some prominent feature that is not shared by the rest (see Fig. 1(a)), or no trajectory is sufficiently 
in the middle all the time. On the other hand, it is desirable to output a trajectory that does consist of 
pieces of input trajectories, because otherwise the representative trajectory may display behaviour that is 
not present in the input, e.g. because of contextual information that is not available to the algorithm (see 
Fig. 1(b)). 

To determine what a good representative trajectory of a group of similar trajectories is, we identify 
two main categories: time-dependent and time-independent representatives. Trajectories are typically 
collected as a discrete sequence of time-stamped locations. By linearly interpolating the locations we 
obtain a continuous piecewise-linear curve as the image of the function. Depending on the application, 
we may be interested in the curve with attached time stamps (say, when studying a flock of animals that 
moved together) or in just the curve (say, when considering hikers that took the same route, but possibly 
at different times and speeds). 

When time is not important, one can select a representative based directly on the geometry or topology 
of the set of curves [8, 18]. When time is important, we would like to have the property that at each 
time t our representative point c(t) is a good representative of the set of points Pit). To this end, we 
may choose any static representative point of a point set, for which many examples are available: the 
Fermat-Weber point (which minimizes the sum of distances to the points in P), the center of mass (which 
minimizes the sum of squared distances), or the center of the smallest enclosing circle (which minimizes 
the distance to the farthest point in P). 
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Fig. 2: (a) Two views of five moving entities and their trajectories, (b) On the top the pairwise distances 
between the entities as a function over time. On the bottom the functions D a , and in yellow the area 
representing V(C). 


Central trajectories. In this work, we focus on time-dependent measures based on static concepts of 
centrality. We choose the distance to the farthest point, but discuss in Section 4 how our results can be 
adapted to other measures. 

Ideally, we would output a trajectory C such that at any time t, C(t ) is the point (entity) that is closest 
to its farthest entity. Unfortunately, when the entities move in R. d for d > 1, this may cause discontinuities. 
Such discontinuities are unavoidable: if we insist that the output trajectory consists of pieces of input 
trajectories and is continuous, then in general, there will be no opportunities to switch from one trajectory 
to another, and we are effectively choosing one of the input trajectories again. At the same time, we do 
not want to output a trajectory with arbitrarily large discontinuities. An acceptable compromise is to 
allow discontinuities, or jumps, but only over small distances, controlled by a parameter e. We note that 
this problem of discontinuities shows up for time-independent representatives for entities moving in W 1 , 
with d > 3, as well, because the traversed curves generally do not intersect. 

Related work. Buchin et al. [8] consider the problem of computing a median trajectory for a set of 
trajectories without time information. Their method produces a trajectory that consists of pieces of 
the input. Agarwal et al. [1] consider trajectories with time information and compute a representative 
trajectory that follows the median (in M 1 ) or a point of high depth (in M 2 ) of the input entities. The resulting 
trajectory does not necessarily stay close to the input trajectories. They give exact and approximate 
algorithms. Durocher and Kirkpatrick [13] observe that a trajectory minimizing the sum of distances 
to the other entities is unstable, in the sense that arbitrarily small movement of the entities may cause 
an arbitrarily large movement in the location of the representative entity. They proceed to consider 
alternative measures of centrality, and define the projection median, which they prove is more stable. 
Basu et al. [4] extend this concept to higher dimensions. 

Problem description. We are given a set X of n entities, each moving along a piecewise linear 
trajectory in W l consisting of r edges. We assume that all trajectories have their vertices at the same times, 
i.e. times to ,... t T . Fig. 2(a) shows an example. For an entity a, let o(t) denote the position of a at time t. 
With slight abuse of notation we will write a for both entity a and its trajectory. At a given time t, we 
denote the distance from a to the entity farthest away from cr by D a (t) = D{a, t) = max^g^ \\a(t)il>(t) |, 
where \\pq\\ denotes the Euclidean distance between points p and q in W 1 . Fig. 2(b) illustrates the pairwise 
distances and resulting D functions for five example trajectories. For ease of exposition, we assume that 
the trajectories are in general position: that is, no three trajectories intersect in the same point, and no two 
pairs of entities are at distance e from each other at the same time. 
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A trajectoid is a function that maps time to the set of entities X, with the restriction that at disconti¬ 
nuities the distance between the entities involved is at most e. Intuitively, a trajectoid corresponds to a 
concatenation of pieces of the input trajectories in such a way that two consecutive pieces match up in 
time, and the end point of the former piece is within distance e from the start point of the latter piece. In 
Fig. 2(b), a trajectoid may switch between a pair of entities when their pairwise distance function lies in 
the bottom strip of height e. More formally, for a trajectoid T we have that 

• at any time t, T(t) = a for some a G X, and 

• at every time t where T has a discontinuity, that is, T jumps from entity cr to entity ib, we have that 

IkWWII < £• 

Note that this definition still allows for a series of jumps within an arbitrarily short time interval 
[t, t + 5], essentially simulating a jump over distances larger than e. To make the formulation cleaner, we 
slightly weaken the second condition, and allow a trajectoid to have discontinuities with a distance larger 
than e, provided that such a large jump can be realized by a sequence of small jumps, each of distance at 
most e. When it is clear from the context, we will write T(i) instead of T(t)(i) to mean the location of 
entity T(f) at time t. We now wish to compute a trajectoid C that minimizes the function 

P(T) = [ T D(r,t)dt. 

■'t 0 

So, at any time t, all entities lie in a disk of radius D(C,t) centered at C{t). 

Outline and results. We first study the situation where entities move in M 1 . In Section 2 we show 
that the worst case complexity of a central trajectory in M 1 is @(m 2 ), and that we can compute one 
in 0(rn 2 logn) time. We then extend our approach to entities moving in W 1 , for any constant d, in 
Section 3. For this case, we prove that the maximal complexity of a central trajectory C is 0(rn 5//2 ). 
Computing C takes 0(rn 3 ) time and requires 0(rn 2 log n) working space. We briefly discuss various 
extensions to our approach in Section 4. Omitted proofs can be found in Appendix ??. 

Even though we do not expect this to happen in practice, the worst case complexity of our central 
trajectories can be significantly higher than the input size. If this occurs, we can use traditional line 
simplification algorithms like Imai and Iri [19] to simplify the resulting central trajectory. This gives us a 
representative that still is always close —for instance within distance 2e — to one of the input trajectories. 
Alternatively, we can use dynamic-programming combined with our methods to enforce the output 
trajectory to have at most k vertices, for any k, and always be on the input trajectories. Computing such a 
central trajectory is more expensive than our current algorithms, however. Furthermore, enforcing a low 
output complexity may not be necessary. For example, in applications like visualization, the number of 
trajectories shown often has a larger impact visual clutter than the length or complexity of the individual 
trajectories. It may be easier to follow a single trajectory that has many vertices than to follow many 
trajectories that have fewer vertices each. 

2 Entities moving in M 1 

Let X be the set of entities moving in M 1 . The trajectories of these entities can be seen as polylines in M 2 : 
we associate time with the horizontal axis, and M 1 with the vertical axis (see Fig. 3(a)). We observe that 
the distance between two points p and q in M 1 is simply their absolute difference, that is, \\pq\\ = \p — q\- 
Let I be the ideal trajectory, that is, the trajectory that minimizes V but is not restricted to lie on the 
input trajectories. It follows that at any time t, l(t) is simply the average of the highest entity 11 (t) and 
the lowest entity L(t). We further subdivide each time interval J ? ; = [ti,ti+i] into elementary intervals, 
such that I is a single line segment inside each elementary interval. 

Lemma 1. The total number of elementary intervals is r(n + 2). 

Proof. The ideal trajectory I changes direction when Zl(t) or Lit) changes. During a single interval 
[tj, tj + i] all entities move along lines, so U and L are the upper and lower envelope of a set of n lines. So 
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Fig. 3: (a) A set of trajectories and the ideal trajectory I. The breakpoints in the ideal trajectory partition time 
into 0(nr) intervals, (b) The trajectories after transforming / to a horizontal line. 


by standard point-line duality, U and L correspond to the upper and lower hull of n points. The summed 
complexity of the upper and lower hull is at most n + 2. □ 

We assume without loss of generality that within each elementary interval I coincides with the x-axis. 
To simplify the description of the proofs and algorithms, we also assume that the entities never move 
parallel to the ideal trajectory, that is, there are no horizontal edges. 

Lemma 2. C is a central trajectory in M 1 if and only if it minimizes the function 

v'(fi)= f T m 

Jt 0 

Proof A central trajectory C is a trajectoid that minimizes the function 


V(T) 


I D(T,t)dt = / max \\‘T(t)'ip(t)\\ dt = 
t 0 Jto 

t T 

max{|'T(i) — U{t)\, |T(t) — L(t)\} dt. 



max |T(f) — fit) I dt 
ipex 


Jt 0 

Since (u(t) + L(t))/2 = 0, we have that \T(t) — U{f) \ > |T(f) — L{t)\ if and only if T(t) < 0. So, 
we split the integral, depending on T(t), giving us 


Vfr) = / -r(t) - L{t) dt + / u(t ) - T(t) dt 

J to<t<t T AT (t)>0 Jto<t<t T AT(t)<0 


lt 0 <t<t T AT(t)>0 


lt o <t<t T AT(t)<0 


T(t) dt — L(t) dt + 

J t Q <t<t T AT(t)>0 

Ti(t)dt— / T(t)dt. 

Jto<t<UAT(t)<0 


We now use that — f T ^ <0 T(t) = f T ^ <0 |CT(t) |, and that —f L{t) = f U(f) (since ( Zl(t ) + 
A(t))/2 = 0). After rearranging the terms we then obtain 


£>(T) = / T(t) dt + / |'T(t)|dt + 

Jt o <t<t T AT(t)>0 Jto<t<t T AT(t)<0 

/ U(t) dt + / U(t) dt 

J to<t<trAT(t)>0 J t 0 <t<t T AT(t)<0 

= / |T(t)|dt + / U(t)dt. 

Jto<t<t T Jto<t<t T 
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Fig. 4: Lower bound construction that shows that C (red) may have quadratic complexity. The ideal trajectory 
I is shown in green. 


The last term is independent of T, so we have D(‘T) = V'i'T) + c, for some c £ R. The lemma 
follows. □ 

By Lemma 2 a central trajectory C is a trajectoid that minimizes the area 'D'i'T) between T and the 
ideal trajectory I. Hence, we can focus on finding a trajectoid that minimizes V. 

2.1 Complexity 

Lemma 3. For a set ofn trajectories in R 1 , each with vertices at times to, ..,t T , a central trajectory C 
may have worst case complexity H(rn 2 ). 

Proof. We describe a construction for the entities that shows that within a single time interval J = [U,ti + 1 ] 
the complexity of C may be Q(n 2 ). Repeating this construction r times gives us Q(rn 2 ) as desired. 

Within J the entities move linearly. So we construct an arrangement A of lines that describes the 
motion of all entities. We place m = n/3 lines such that the upper envelope of A has linear complexity. 
We do the same for the lower envelope. We position these lines such that the ideal trajectory I —which is 
the average of the upper and lower envelope— makes a vertical “zigzagging” pattern (see Fig. 4). The 
remaining set H of m lines are horizontal. Two consecutive lines are placed at (vertical) distance at most 
e. We place all lines such that they all intersect 7. It follows that C jumps 0(n 2 ) times between the lines 
in H. The le mm a follows. □ 

Two entities a and A are e-connected at time t if there is a sequence rr = op, ... a/. = f of entities 
such that for all i, ai and a , + 1 are within distance e of each other at time t. A subset X' C A of entities 
is e-connected at time t if all entities in X' arc pairwise e-connected at time t. The set X' is e-connected 
during an interval /, if they are e-connected at any time t 6 I. We now observe: 

Observation 4. C can jump from entity a to f> at time t if and only if a and Tj are e-connected at time t. 

At any time t, we can partition X into maximal sets of e-connected entities. The central trajectory 
C must be in one of such maximal sets X'\ it uses the trajectory of an entity o G X' (at time t), if and 
only if o is the entity from X' closest to I. More formally, let f a (t) = |rr(7) |, and let L(fF) = min j G jr f 
denote the lower envelope of a set of functions T. 

Observation 5. Let X' 3 o be a set of entities that is e-connected during interval J, and assume that 
C E X' during J. For any time t £ J, we have that C{t) = cr(t) if and only if f a is on the lower envelope 
of the set T l = {/p | f> G X'} at time t, that is, f a {t) = L(F)(t). 

Let X\,... X rn , denote a collection of maximal sets of entities that are e-connected during time 
intervals J\,.., J m , respectively. Let T r = { f„ \ a G Xf\, and let L, be the lower envelope L[fFf) of Ti 
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restricted to interval ,J, . A lower envelope L r has a break point at time t if fait) = fyft), for a, ip E X t . 
There are two types of break points: (i) apt) = ippt), or (ii) a(t) = —ip(t). At events of type (i) the 
modified trajectories of a and A intersect. At events of the type (ii), a and A are equally far from 7, but on 
different sides of 7. Let B = {(t, a, -0) \ Lift) = f a (t) = fypt) A i E {1,m}} denote the collection 
of break points from all lower envelopes L\.... L rn . 

Lemma 6. Consider a triplet (t. a, ip) E B. There is at most one lower envelope Li such that t is a break 
point in L h . 

Proof. Assume by contradiction that t is a break point in both L, and Lj. At any time t, an entity can be 
in at most one maximal set X(. So if Xi and Xj share either entity a or ip, then the intervals and Jj 
are disjoint. It follows t cannot lie in both intervals, and thus cannot be a break point in both Li and Lj. 
Contradiction. □ 

Lemma 7. Let A be an arrangement of n lines, describing the movement of n entities during an 
elementary interx’al J. If there is a break point pt, a , ip) E B, with t E J, of type (ii), then a(t) and ip(t) 
lie on the boundary dZ of the zone Z of I in A. 

Proof. Let Xj be the maximal e-connected set containing a and ip, and assume without loss of generality 
that f a {t) = cr(t) = —ip(t) = fyj(t). Now, assume by contradiction that er is not on dZ at time t (the 
case that ip(t) is not on dZ is symmetric). This means that there is an entity p with 0 < p(t) < apt). If 
p E Xj, this contradicts that f a (t) was on the lower envelope of X ;l at time t. So p is not e-connected to 
a at time t. Hence, their distance is at least e. We then have a(t) > p(t) + e > e. it now follows that 
a and ip cannot be e-connected at time f : the distance between a and ip is bigger than e so they are not 
directly connected, and f a and /,/, are on Lj, so there are also no other entities in Xj through which they 
can be e-connected. Contradiction. □ 

Lemma 8. Let A be an arrangement of n lines, describing the movement of n entities during an 
elementary interval J. The total number of break points (t , a, 

6.5 n. 

Proof. By Lemma 6 all break points can be charged to exactly 
one set Xj. From Lemma 7 it follows that break points of 
type (ii) involve only entities whose lines in A participate in 
the zone of I. 

Let E be the set of edges of dZ. We have that \E\ < 

5.5 n [5, 22]. We now split every edge that intersects I, at 
the intersection point. Since every line intersects I at most 
once, this means the number of edges in E increases to 6.5n. 

For every pair of edges (e, g), that lie on opposite sides of I, 
there is at most one time t where a lower envelope L = Lj, 
for some j, has a break point of type (ii). 

Consider a break point of type (ii), that is, a time t such that L switches (jumps) from an entity a to 
an entity ip, with a and ip on opposite sides of 7. Let e E E and g E E be the edges containing apt) and 
ippt), respectively. If the arriving edge g has not been charged before, we charge the jump to g. Otherwise, 
we charge it to e. We continue to show that every edge in E is charged at most once. Since E has at most 
6.5n edges, the number of break points of type (ii) is also at most 6.5n. 

We now show that either e or g has not been charged before. Assume, by contradiction, that both e 
and g have been charged before time t, at times t e and t g , respectively. Consider the case that tg<t e 
(see Fig. 5). At time t e , the lower envelope L jumps from an edge h onto e or vice versa. Since there 
is a jump involving edge g at time t g and one at time t it follows that at time t e , g is the closest edge in 
E opposite to e. Hence, h = g. This means we jump twice between e and g. Contradiction. The case 
t e < t g is symmetrical and the case t e = t g cannot occur. It follows that e or g was not charged before 
time t, and thus all edges in E are charged at most once. □ 


0) E B, with t E ■/, of type (ii) is at most 



involving edges e and g. 
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Lemma 9. The total complexity of all lower envelopes 4,.., L rn on [t,, t l+ \' is 0{n 2 ). 

Proof. The break points in the lower envelopes are either of type (i) or of type (ii). We now show that 
there are at most 0(n 2 ) break points of either type. 

The break points of type (i) correspond to intersections between the trajectories of two entities. Within 
interval [tj,fj_|_i] the entities move along lines, hence there are at most 0(n 2 ) such intersections. By 
Lemma 6 all break points can be charged to exactly one set Tj. It follows that the total number of break 
points of type (i) is 0(n 2 ). 

To show that the number of events of the second type is at most 0(n 2 ) as well we divide \t r , f,;+i ] 
in 0(n) elementary intervals such that I coincides with the x-axis. By Lemma 8 each such elementary 
interval contains at most 0{n ) break points of type (ii). □ 

Theorem 10. Given a set ofn trajectories in M 1 , each with vertices at times to, t T , a central trajectory 
C has worst case complexity 0{rn 2 ). 

Proof A central trajectory C is a piecewise function. From Observations 4 and 5 it now follows that C 
has a break point at time t only if (a) two subsets of entities become e-connected or e-disconnected, or 
(b) the lower envelope of a set of e-connected entities has a break point at time t. Within a single time 
interval Jj = [tj, t, + \] there are at most 0(rr) times when two entities are at distance exactly e. Hence, 
the number of events of type (a) during interval Jj is also 0(n 2 ). By Lemma 9 the total complexity of all 
lower envelopes of e-connected sets during Jj is also 0{n 2 ). Hence, the number of break points of type 
(b) within interval Jj is also 0{n 2 ). The theorem follows. □ 

2.2 Algorithm 

We now present an algorithm to compute a trajectoid 
C minimizing V. By Lemma 2 such a trajectoid is 
a central trajectory. The basic idea is to construct a 
weighted (directed acyclic) graph that represents a set 
of trajectoids containing an optimal trajectoid. We can 
then find C by computing a minimum weight path in 
this graph. 

The graph that we use is a weighted version of 
the Reeb graph that Buchin et al. [9] use to model the 
trajectory grouping structure. We review their definition 
here. The Reeb graph JZ is a directed acyclic graph. 

Each edge e = (u, v ) of 7 Z corresponds to a maximal 
subset of entities C e C X that is e-connected during 
the time interval [t u , t v ]. The vertices represent times 
at which the sets of e-connected entities change, that is, the times at which two entities a and ip are at 
distance e from each other and the set containing a merges with or splits from the set containing ip. See 
Fig. 6 for an illustration. 

By Observation 4 a central trajectory C can jump from a to ip if and only if a and ip are e-connected, 
that is, if a and ip are in the same component C e of edge e. From Observation 5 it follows that on 
each edge e, C uses only the trajectories of entities a for which f a occurs on the lower envelope of the 
functions T e = {f a | a E C e }. Hence, we can then express the cost for C using edge e by 

uj e = [ L(fF e ){t)dt. 

J t u 

It now follows that C follows a path in the Reeb graph 'IZ. that is, the set of trajectoids represented by 
7 Z contains a trajectoid minimizing V. So we can compute a central trajectory by finding a minimum 
weight path in TZ from a source to a sink. 





Fig. 6: The Reeb graph for the moving entities 
from Fig. 3(a). The dashed lines indi¬ 
cate that two entities are at distance e. 
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Analysis. First we compute the Reeb graph as defined by Buchin et al. [9]. This takes 0(rn 2 logn) 
time. Second we compute the weight cu e for each edge e. The Reeb graph R. is a DAG, so once 
we have the edge weights, we can use dynamic programming to compute a minimum weight path in 
0(\1Z\) = 0(rn 2 ) time. So all that remains is to compute the edge weights uj e . For this, we need the 
lower envelope L e of each set T e on the interval J e . To compute the lower envelopes, we need the ideal 
trajectory 7, which we can compute 7 in 0(rn log n) time by computing the lower and upper envelope of 
the trajectories in each time interval [t®, i»+i]. 

Lemma 9 implies that the total complexity of all lower envelopes is 0(rn 2 ). To compute them we 
have two options. We can simply compute the lower envelope from scratch for every edge of 1Z. This 
takes 0(rn 2 • n log n) = 0(rn 3 log n) time. Instead, for each time interval Ji = [ti, L+i], we compute 
the arrangement A representing the modified trajectories on the interval J,, and use it to trace L e in A for 
every edge e of 7Z. 

An arrangement of m line segments can be built in 0(m log m + A) time, where A is the output 
complexity [2]. In total A consists of 0(n 2 ) line segments: n + 2 per entity. Since each pair of trajectories 
intersects at most once during p, we have that A = 0{n 2 ). Thus, we can build A in 0(n 2 logn) time. 
The arrangement A represents all break points of type (i), of all functions f a . We now compute all pairs 
of points in A corresponding to break points of type (ii). We do this in 0(ri 2 ) time by traversing the zone 
of 7 in A. 

We now trace the lower envelopes through A: for each edge e = (it, v ) in the Reeb graph with J e C p, 
we start at the point a(t u ), a E C e , that is closest to 7, and then follow the edges in A corresponding to 
L e , taking care to jump when we encounter break points of type (ii). Our lower envelopes are all disjoint 
(except at endpoints), so we traverse each edge in A at most once. The same holds for the jumps. We can 
avoid costs for searching for the starting point of each lower envelope by tracing the lower envelopes in 
the right order: when we are done tracing L e , with e = (it, v), we continue with the lower envelope of an 
outgoing edge of vertex v. If v is a split vertex where a and L are at distance e, then the starting point of 
the lower envelope of the other edge is either cr(t v ) or f(t v ), depending on which of the two is farthest 
from 7. It follows that when we have A and the list of break points of type (ii), we can compute all lower 
envelopes in 0(n 2 ) time. We conclude: 

Theorem 11. Given a set ofn trajectories in R 1 , each with vertices at times to, t T , we can compute a 
central trajectory C in 0{rn 2 logn) time using 0(Tn 2 ) space. 

A central trajectory without jumps. When our entities move in R 1 , it is not yet necessary to have 
discontinuities in C, i.e. we can set e = 0. In this case we can give a more precise bound on the complexity 
of C, and we can use a slightly easier algorithm. The details can be found in Appendix A. 

3 Entities moving in R d 

In the previous section, we used the ideal trajectory 7, which minimizes the distance to the farthest entity, 
ignoring the requirement to stay on an input trajectory. The problem was then equivalent to finding a 
trajectoid that minimizes the distance to the ideal trajectory. In M d , with d > 1, however, this approach 
fails, as the following example shows. 

Observation 12. Let P be a set of points in R 2 . The point in P that minimizes the distance to the ideal 
point (i.e., the center of the smallest enclosing disk of P) is not necessarily the same as the point in P 
that minimizes the distance to the farthest point in P. 

Proof. See Fig. 7. Consider three points a, b and c at the corners of an equilateral triangle, and two points 
p and q close to the center m of the circle through a, b and c. Now p is closer to m than q, yet q is closer 
to b than p (and q is as far from a as from b). □ 



Fig. 7: Point p is closest to the ideal point to, however the smallest enclosing disk centered at q is smaller 
than that of p. 

3.1 Complexity 

It follows from Lemma 3 that the complexity of a central trajectory for entities moving in W 1 is at least 
fl(rn 2 ). In this section, we prove that the complexity of C within a single time interval \t , l , t l+ \ ] is at 
most 0(n 5 / 2 ). Thus, the complexity over all r time intervals is 0(rn 5//2 ). 

Let T denote the collection of functions D a , for a £ X. We partition time into intervals J\. ..,./[, 
such that in each interval J[ all functions D a restricted to J[ are simple , that is, they consist of just one 
piece. We now show that each function D a consists of at most r(2n — 1) pieces, and thus the total 
number of intervals is at most 0{rn 2 ). See Fig. 2(b) for an illustration. 

Lemma 13. Each function D a is piecewise hyperbolic and consists of at most r(2n — 1) pieces. 

Proof Consider a time interval J* = [L. t l+ \ ]. For any entity ip and any time t £ , the function 

||cr(f)V ; (f)|| = v at 2 + bt + c, with a. b, c £ M, is hyperbolic in t. Each pair of such functions can 
intersect at most twice. During Jj, D a is the upper envelope of these functions, so it consists of A 2 (n) 
pieces, where X s denotes the maximum complexity of a Davenport-Schinzel sequence of order s [3]. We 
have A 2 (n) = 2n — 1, so the lemma follows. □ 

Lemma 14. The total number of intersections of all functions in T is at most 0(rn 3 ). 

Proof Fix a pair of entities a, ip. By Lemma 13 there are at most r(2 n — 1) time intervals J, such that 
D a restricted to J is simple. The same holds for /),,,. So, there are at most T(4n — 2) intervals in which 
both D a and D,/, are simple (and hyperbolic). In each interval D a and D v; intersect at most twice. □ 

We again observe that C can only jump from one entity to another if they are e-connected. Hence, 
Observation 4 holds entities moving in W 1 as well. As before, this means that at any time t, we can 
partition X into maximal sets of e-connected entities. Let X' 3 a be a maximal subset of e-connected 
entities at time t. This time, a central trajectory C uses the trajectory of entity a at time t, if and only if cr 
is the entity from X' whose function V a is minimal. Hence, if we define f a = D a Observation 5 holds 
again as well. 

Consider all m' = 0(n 2 ) intervals ./(,... Jf , that together form [tj,tj+i]- We subdivide these 
intervals at points where the distance between two entities is exactly e. Let J \,.., J m denote the set of 
resulting intervals. Since there are 0(n 2 ) times at which two entities are at distance exactly e, we still 
have 0(n 2 ) intervals. Note that for all intervals J, and all entities a, f a is simple and totally defined on 

Ji. 

In each interval .f, a central trajectory C uses the trajectories of only one maximal set of e-connected 
entities. Let X[ be this set, let = { f a \ cr £ X'} be the set of corresponding functions, and let L r be 
the lower envelope of T[, restricted to interval J*. We now show that the total complexity of all these 
lower envelopes is 0(n 5 / 2 ). It follows that the maximal complexity of C in J, is at most 0(n 5//2 ) as well. 

Lemma 15. Let J be an interval, let T be a set of hyperbolic functions that are simple and totally defined 
on J, and let k denote the complexity of the lower envelope L of T restricted to J. Then there are Q( k 2 ) 
intersections of functions in T that do not lie on L. 
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Fig. 8: The function / (blue) has at least 4-2 = 
L(i — l)/2j functions from J-j (black). 


Proof. Let L = L\. ... L/. denote the pieces of 
the lower envelope, ordered from left to right. 

Consider any subsequence L' = Li,...,Lj of 
the pieces. The functions in T are all hyper¬ 
bolic, so every pair of functions intersect at most 
twice. Therefore L' consists of at most A 2 (| ) = 

2\T\ — 1 pieces. Hence, i < 2\T\ — 1. The 
same argument gives us that there must be at least 
4 = L(* + 1)/2J distinct functions of T contribut¬ 
ing to l !. 

Consider a piece Lj = [a, b] such that a is 
the first time that a function / contributes to the 
lower envelope. That is, a is the first time such 
that /(f) = L(t). Clearly, there are at least (/,. such 

pieces. Furthermore, there are at least 4-2 distinct functions corresponding to the pieces Li,Lj_ 2 . Let 
T denote the set of those functions. 

All functions in T are continuous and totally defined, so they span time interval J. It follows that all 
functions in Ti must intersect / at some time after the start of interval J, and before time a. Since a was 
the first time that / lies on L, all these intersection points do not lie on L. See Fig. 8. In total we have at 

4 L(fc+i)/2j L(fc+i)/2j —1 

y>_2= [(i - 1)/2J — XI L*/2J = 0(fc 2 ) 

i =2 i =2 i =1 

such intersections. □ 


Lemma 16. Let T\.T m be a collection ofm sets of hyperbolic partial functions, let J \,.., J m be a 
collection of intervals such that: 

• the total number of intersections between functions in T\ . T m is at most 0{n 3 ), 

• for any two intersecting internals Ji and Jj, Ti and Tj are disjoint, and 

• for every set Ti, all functions in Ti are simple and totally defined on Jj. 

Let Li denote the lower envelope of Ti restricted to Ji, The total complexity of the lower envelopes 
Li, An is 0((m + n 2 )y/n). 

Proof Let kj denote the complexity of the lower envelope A- An interval Jj is heavy if k% > yfn and 
light otherwise. Clearly, the total complexity of all light intervals is at most Oirnfn). What remains is 
to bound the complexity of all heavy intervals. 

Relabel the intervals such that J \,.., Jj, are the heavy intervals. By Lemma 15 we have that in each 
interval Jj, there arc at least ckf intersections involving the functions T, for some c E M. 

Since for every pair of intervals Jj and Jj that overlap the sets T % and Tj are disjoint, we can associate 
each intersection with at most one interval. There are at most 0(n 3 ) intersections in total, thus we have 
fin 3 > 1 — Yli =1 c ^i’ f° r some fi £ M. Using that for all heavy intervals A:,; > fn we obtain 

h h h 

fin 3 > ^2 ckf > ^2 cy/nki = c\fn ^2 hi- 

i=l 2—1 Z=1 

It follows that the total complexity of the heavy intervals is Y2l= 1 4 fin 3 /cy/n = 0(n 2 y/n). □ 

By Lemma 14 we have that the number of intersections between functions in T in time interval 
[tj, 4 + 1 ] is 0(n 3 ). Hence, the total number of intersections over all functions in all sets T[ is also 0(n 3 ). 
All functions in each set T[ are simple and totally defined on Jj, and all intervals J\.J rn are pairwise 
disjoint, so we can use Lemma 16. It follows that the total complexity of l! x ,.., l! m is at most 0(n 5//2 ). 
Thus, in a single time interval the worst case complexity of C is also at most 0 (n 5/2 ) . We conclude: 

Theorem 17. Given a set ofn trajectories in K'( each with vertices at times to,t T , a central trajectory 
C has worst case complexity Ofirn 5 / 2 ). 
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Fig. 9: The modified Reeb graph 7 Z for five entities moving during a single interval, and the corresponding 
functions f a for each entity a. 

3.2 Algorithm 

We use the same global approach as before: we represent a set of trajectoids containing an optimal 
solution by a graph, and then compute a minimum weight path in this graph. The graph that we use, is a 
slightly modified Reeb graph. We split an edge e into two edges at time t if there is an entity a £ C e 
such that D a = f a has a break point at time t. All functions f a , with a £ C e , are now simple and totally 
defined on J e . This process adds a total of 0[rn 2 ) degree-two vertices to the Reeb graph. Let 7 Z denote 
the resulting Reeb graph (see Fig. 9). 

To find all the times where we have to insert vertices, we explicitly compute the functions D a . 
This takes 0(rnA2(n) logn) = 0(rn 2 logn) time, where A s denotes the maximum complexity of a 
Davenport-Schinzel sequence of order s [3], since within each time interval [L, f i+1 ] each D a is the upper 
envelope of a set of n functions that intersect each other at most twice. After we sort these break points in 
0(rn 2 log n) time, we can compute the modified Reeb graph 7 Z in 0(rn 2 log n) time [9]. 1 

Next, we compute all weights u e , for each edge e. This means we have to compute the lower envelope 
L e of the functions J- e = {f a \ a £ C e } on the interval J e . All these lower envelopes have a total 
complexity of at most 0(rn 5 / 2 ): 

Lemma 18. The total complexity of the lower envelopes for all edges of the Reeb graph is 0(rn 5//2 ). 

Proof We consider each time interval J r = [L, t t+ \] separately. Let TZ, denote the Reeb graph restricted 
to Ji. We now show that for each 7 Zi, the total complexity of all lower envelopes L e of edges e in TZ, is 
0(n 2 y/n). The lemma then follows. 

By Lemma 14, the total number of intersections of all functions F e , with e in '1Z ,, is 0(n 3 ). Each set 
T e corresponds to an interval J e , on which all functions in T, are simple and totally defined. Furthermore, 
at any time, every entity is in at most one component C e . So, if two intervals J e and J e t overlap, the sets 
of entities C e and C e >, and thus also the sets of functions T e and T e i are disjoint. It follows that we can 
apply Lemma 16. Since IZi has 0(n 2 ) edges the total complexity of all lower envelopes is Oirf fTi). 
□ 

We again have two options to compute all lower envelopes: either we compute all of them from 
scratch in 0{rn 2 ■ A 2 (n) logn) = 0(rn 3 logn) time, or we use a similar approach as before. For each 
time interval, we compute the arrangement A of all functions T, and then trace L e in A for every edge e. 
For n 2 functions that pairwise intersect at most twice, the arrangement can be built in 0(n 2 log n + A) 
expected time, where A is the output complexity [2]. The complexity of A is 0(n 3 ), so we can construct 

1 The algorithm to compute the Reeb graph is presented for entities moving in R 2 in [9], but it can easily be extended to 

entities moving in R d . 
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it in 0(n 3 ) time. As before, every edge is traversed at most once so tracing all lower envelopes L e takes 
0(n 3 ) time. It follows that we can compute all edge weights in 0(rn 3 ) time, using 0(n 3 ) working 
space. 

Computing a minimum weight path takes 0(rn 2 ) time, and uses 0(rn 2 ) space as before. Thus, we 
can compute C in 0(rn 3 ) time and 0(n 3 + rn 2 ) space. 

Reducing the required working space. We can reduce the amount of working space required to 
0(n 2 log n + rn 2 ) as follows. Consider computing the edge weights in the time interval J = [f t , ti + 1 ]. 
Interval J is subdivided into 0(v 2 ) smaller intervals Ji,.., J m as described in Section 3.1. We now 
consider groups of r consecutive intervals. Let J be the union of r consecutive intervals, we compute 
the arrangement A of the functions F, restricted to time interval J. Since every interval J, has at most 

0(n 2 ) intersections A has worst case complexity O(nr ). Thus, at any time we need at most 0(rn ?) 

2 / 

space to store the arrangement. In total this takes 0(^)=i ( n i log n t + A*)) time, where n, is the 
number of functions in the i th group of intervals, and A* is the complexity of the arrangement in group 
i. The total complexity of all arrangements is again 0(n 3 ). Since we cut each function D a into an 
additional 0{n 2 jr) pieces, the total number of functions is 0(n 3 /r + n 2 ). Hence, the total running time 
is 0((n 3 /r) logn + n 3 ). We now choose r = 0(logn) to compute all edge weights in [ti,ti+i] in 0(v 3 ) 
time and 0(n 2 logn) space. We conclude: 

Theorem 19. Given a set of n trajectories in W l , each with vertices at times to,... t T , we can compute a 
central trajectory C in 0(tv 3 ) time using 0{n 2 logn + rn 2 ) space. 

4 Extensions 

We now briefly discuss how our results can be extended in various directions. 

Other measures of centrality. We based our central trajectory on the center of the smallest enclosing 
disk of a set of points. Instead, we could choose other static measures of centrality, such as the Fermat- 
Weber point, which minimizes the sum of distances to the other points, or the center of mass, which 
minimizes the sum of squared distances to the other points. In both cases we can use the same general 
approach as described in Section 3. 

Let D 2 (t) = || cr(i)'0(t) || 2 denote the sum of the squared Euclidean distances from a to all 

other entities at time t. This function D 2 is piecewise quadratic in t, and consists of (only) 0(t) pieces. 
It follows that the total number of intersections between all functions D 2 , a € F, is at most 0(rn 2 ). We 
again split the domain of these functions into elementary intervals. The Reeb graph TZ representing the 
e-connectivity of the entities still has 0(rn 2 ) vertices and edges. Each vertex of C corresponds either 
to an intersection between two functions D 2 and D 2 ., or to a jump, occurring at a vertex of TZ. It now 
follows that C has complexity 0(tv 2 ). 

To compute a central trajectory, we compute a shortest path in the (weighted) Reeb graph TZ. To 
compute the weights we again construct the arrangement of all curves D 2 , and trace the lower envelope 
L e of the curves associated to each edge e £ TZ. This can be done in 0{tv? log n) time in total. 

The sum of Euclidean distances D a (t ) = ll°’(0' i /’(^)ll * s a sum °f square roots, and cannot be 

represented analytically in an efficient manner. Hence, we cannot efficiently compute a central trajectory 
for this measure. 

Similarly, depending on the application, we may prefer a different way of integrating over time. 
Instead of the integral of D, we may, for example, wish to minimize max/ D(-,t ) or j l) 2 (-.t). Again, 
the same general approach still works, but now, after constructing the Reeb graph, we compute the 
weights of each edge differently. 

Minimizing the distance to the Ideal Trajectory 7. We saw that for entities moving in M 1 , minimizing 
the distance from C to the farthest entity is identical to minimizing the distance from C to the ideal 
trajectory I (which itself minimizes the distance to the farthest entity, but is not constrained to lie on an 
input trajectory). We also saw that for entities moving in M d , d > 1. these two problems are not the same. 
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So, a natural question is if we can also minimize the distance to 7 in this case. It turns out that, at least for 
M 2 , we can again use our general approach, albeit with different complexities. 

Demaine et al. [11] show that for entities moving along lines 2 in M 2 the ideal trajectory 7 has 
complexity 0(n 3+s ) for any <5 > 0. It follows that the function D a (t) = | l(t)a(t)\\ is a piecewise 
hyperbolic function with at most 0(rn 3+s ) pieces. The total number of intersections between all 
functions D a , for rr £ A 7 , is then 0(rn 5+<5 ). Similar to Lemma 16, we can then show that all lower 
envelopes in 1Z together have complexity 0(-™ 4+ ' 5 ). We then also obtain an 0(rn 4+<5 ) bound on the 
complexity of a central trajectory C minimizing the distance to 7. 

To compute such a central trajectory C we again construct TZ. To compute the edge weights it 
is now more efficient to recompute the lower envelope L e for each edge e from scratch. This takes 
0(rn 3 • nlogn) = 0(rn 4 logn) time, whereas constructing the entire arrangement may take up to 
0(rn 5+<5 ) time. 

We note that the 0(n 3+s ) bound on the complexity of 7 by Demaine et al. [11] is not known to be 
tight. The best known lower bound is only fl(n 2 ). So, a better upper bound for this problem immediately 
also gives a better bound on the complexity of C. 

Relaxing the input pieces requirement. We require each piece of the central trajectory to be part of 
one of the input trajectories, and allow small jumps between the trajectories. This is necessary, because 
in general no two trajectories may intersect. Another interpretation of this dilemma is to relax the 
requirement that the output trajectory stays on an input trajectory at all times, and just require it to be 
close (within distance e) to an input trajectory at all times. In this case, no discontinuities in the output 
trajectory are necessary. 

We can model this by replacing each point entity by a disk of radius e. The goal then is to compute a 
shortest path within the union of disks at all times. We now observe that if at time t the ideal trajectory 7 
is contained in the same component of e-disks as C, the central trajectory will follow 7. If 7 lies outside of 
the component, C travels on the border of the e-disk (in the component containing C) minimizing 
In terms of the distance functions, this behavior again corresponds to following the lower envelope of a 
set of functions. We can thus identify the following types of break points of C : (i) break points of 7, (ii) 
breakpoints in one of the lower envelopes L\ ,.., L m corresponding to the distance functions of the entities 
in each component, and (iii) break points at which C switches between following 7 and following a lower 
envelope L 3 . There are at most 0(rn 3+s ) break points of type (i) [11], and at most 0(Tn 2 ^/n) of type 
(ii). The break points of type (iii) correspond to intersections between 7 and the manifold that we get by 
tracing the e-disks over the trajectory. The number of such intersections is at most 0(rn 4+5 ). Hence, in 
this case C has complexity 0(rn 4+ ' i ). We can thus get an 0(rn 5+ ' 5 log n) algorithm by computing the 
lower envelopes from scratch. 
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A A continuous central trajectory for entities moving in M 1 
A.1 Complexity 


We analyze the complexity of central trajectory C\ a trajectoid that minimizes V, in the case e = 0. We now show 
that on each elementary interval C has complexity 24 n. It follows that the complexity of C during a time interval 
[L, ti + 1 ] is 24n 2 + 48n, and that the total complexity is 0 (to 2 ). 

We are given n lines representing the movement of all the entities during an elementary interval. We split 
each line into two half lines at the point where it intersects I (the x-axis). This gives us an arrangement A of 2 n 
half-lines. A half-line £ is positive if it lies above the x-axis, and negative otherwise. If the slope of £ is positive, £ 
is increasing. Otherwise it is decreasing. For a given point p on £, we denote the “sub”half-line of £ starting at p by 
£ p- \ Let C be a trajectoid that minimizes V. 

Lemma 20. Let £ and m be two positive increasing half-lines, of which £ has the largest slope, and let v be their 
intersection point. At vertex v, C does not continue along £ v “L 

Proof. Assume for contradiction that C starts to travel along l v ~^ at vertex v (see Fig. 10). Let t*, with t* > t v , 
be the first time where C intersects m again after visiting C ,_ \ or oo if no such time exists. Now consider the 
trajectoid T, such that T(t) = m(t) for all times t £ [t v ,t*], and T(f) = C[t) for all other times t. At any time t 
in the interval (t v , t*) we have 0 < T(t) < C{t). It follows that V(fT) < V'(C). Contradiction. □ 


£ 



Fig. 10 : A central trajectory does not visit the half-line £ v ^. 


Corollary 21. Let £ and m be two positive increasing half-lines, of which £ is the steepest, and let v be their 
intersection point. C does not visit that is, £ v (T C = 0. 

Consider two positive increasing half-lines £ and £' in A, of which £ is the steepest, and let v be their intersection 
point. Corollary 21 guarantees that C never uses the half-line f u_ L Hence, we can remove it from A (thus replacing 
£ by a line segment) without affecting C. By symmetry, it follows that we can clip one half-line from every pair of 
half-lines that are both increasing or decreasing. Let A! be the arrangement that we obtain this way. See Fig. 11 for 
an illustration. 




Fig. 11: (a) The clipped increasing half-lines (purple) on top of the original half-lines (gray), and (b) the 
resulting arrangement A' and its zone (in yellow). 

Consider the set Z of open faces of A! intersected by I, and let E be the set of edges bounding them. We refer 
to Z U E as the zone of I in A!. 

Lemma 22. C is contained in the zone of I in A'. 

Proof. We can basically use the argument as in Lemma 20. Assume by contradiction that C lies outside the zone 
from u to v. The path from u to v along the border of the zone is x-monotone, hence there is a trajectoid T that 
follows this path during [t u , t v ) and for which T(f) = C(t) at any other time. It follows that V(fT) < V{C) giving 
us a contradiction. □ 


15 


Lemma 23. The zone Zp of a line £ in A' has maximum complexity 8 n. 

Proof. We show that the complexity of Zp restricted to the positive half-plane is at most An. A symmetric argument 
holds for the negative half-plane, thus proving the lemma. Since we restrict ourselves to the positive half-plane, the 
half-lines and segments of A! correspond to two forests: a purple forest with p segments 3 , and a brown forest with 
b segments. Furthermore, we have p + b = n. 

Rotate all segments such that £ is horizontal. We now show that the number of left-bounding edges in Zp is 2 n. 
Similarly, the number of right-bounding edges is also 2 n. Consider just the purple forest. Clearly, there are at most 
p left-bounding edges in the zone of £ in the purple forest. We now iteratively add the edges of the brown forest 
in some order that maintains the following invariant: the already-inserted brown segments form a forest in which 
every tree is rooted at an unbounded segment (a half-line). We then show that every new left-bounding edge in the 
zone either replaces an old left-bounding edge or can be charged to a purple vertex or a brown segment. In total we 
gather p + 2b charges, giving us a total of 2p + 2b = 2n left bounding edges. 

Let s be a new brown leaf segment that we add to A', and consider the set J of all intersection points of s 
with edges that form Zp in the arrangement so far. The points in J subdivide s into subsegments si,.., Sk (See 
Fig. 12). All new edges in Zp are subsegments of s. We charge the subsegment s, that intersects £ (if any), and Sk 
to s itself. The remaining subsegments replace either a brown edge or a purple vertex from Zp, or they yield no new 
left bounding edges. Clearly, segments replacing edges on Zp do not increase the complexity of Zp. We charge the 
segments replacing purple vertices to those vertices. We claim that a vertex v gets charged at most once. Indeed, 
each vertex has three incident edges, only two of which may intersect £. The vertex gets charged when a brown 
segment intersects those two edges between v and £. After this, v is no longer part of Zp in that face (though it may 
still be in Zp in its other faces). It follows that the total number of charges is p+ 2b. □ 



Fig. 12: The new segment s (fat) subdivided into subsegments. S 3 and sq are charged to s, S 5 is charged 
to a purple vertex. Segment 54 replaces a purple edge in the zone, and si and S 2 do not give new 
left bounding edges. 


Theorem 24. Given n lines, a trajectoid C that minimizes V 1 has worst case complexity 8 n. 

Proof. C is contained in the zone of A! . So the intersection vertices of C are vertices in the zone of A! . By 
Lemma 23, the zone has at most 8n vertices, so C has at most 8 n vertices as well. □ 

Let A* be the total arrangement of all restricted functions; that is, a concatenation of the arrangements A! 
restricted to the vertical slabs defined by their elementary time intervals. Define the global zone as the zone of J in 
A*. Note that the global zone is more than just the union of the individual zones in the slabs, since cells can be 

3 Note that some of these segments —the segments corresponding to the roots of the trees— are actually half-lines. 
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connected along break points (and are not necesarily convex anymore). Nonetheless, we can still show that the 
complexity of the global zone is linear. 

Lemma 25. The global zone has complexity 24rn 2 + 48m 

Proof. The global zone is a subset of the union of the zones of J and the vertical lines x = ti , for i £ 0,.., k in the 
arrangements A'. By Lemma 23, a line intersecting a single slab has zone complexity 8 n. Each slab is bounded 
by two vertical lines and intersected by J, so applying the lemma three times yields a 24n upper bound on the 
complexity in a single slab. Since there are r(n + 2) elementary intervals, we conclude that the total complexity is 
at most 24rn 2 + 48rn. □ 

As before, it follows that C is in the zone of / in A*. Thus, we conclude: 

Theorem 26. Given a set ofn trajectories in R 1 , each with vertices at times to,t T , a central trajectory C with 
e = 0, has worst case complexity 0(rn 2 ). 

A.2 Algorithm 

We now present an algorithm to compute a trajectoid C minimizing V . It follows from Lemma 2 that such a 
trajectoid is a central trajectory. The basic idea is to construct a weighted graph that represents a set of trajectoids, 
and is known to contain an optimal trajectoid. We then find C by computing a minimum weight path in this graph. 

The graph that we use is simply a weighted version of the global zone of I. We augment each edge e = (u , v ) 
in the global zone with a weight /)*” |e(i) d t. Hence, we obtain a weighted graph Q. Finally, we add one source 
vertex that we connect to the vertices at time to with an edge of weight zero, and one target vertex that we connect 
with all vertices at time t T . This graph represents a set of trajectoids, and contains an optimal trajectoid C. 

We find C by computing a minimum weight path from the source to the target vertex. All vertices except the 
source and target vertex have constant degree. Furthermore, all zones have linear complexity. It follows that Q has 
0(rn 2 ) vertices and edges, and thus, if we have Q , we can compute C in 0(rn 2 log(rn)) time. 

We compute Q by computing the zone(s) of the arrangement A! in each elementary interval. We can find the 
zone of A! in 0((n + k)a(n + k ) logn) = 0(na(n) logn) expected time, where k is the complexity of C and a 
is the inverse Ackermann function, using the algorithm of Har-Peled [17]. Since A' has a special shape, we can 
improve on this slightly as follows. 

We use a sweep line algorithm which sweeps A! with a vertical line from left to right. We describe only 
computing the upper border of the zone (the part that lies above I). Computing the lower border is analogous. So in 
the following we consider only positive half-lines. 

We maintain two balanced binary search trees as status structures. One storing all increasing half-lines, and 
one storing all decreasing half-lines. The binary search trees are ordered on increasing y-coordinate where the 
half-lines intersect the sweep line. We use a priority queue to store all events. We distinguish the following events: 
(i) a half-line starts or stops at I, (ii) an increasing (decreasing) half-line stops (starts) because it intersects an other 
increasing (decreasing) half-line, and (iii) we encounter an intersection vertex between an increasing half-line and a 
decreasing half-line that lies in the zone. In total there are 0(n) events. 

The events of type (ii) involve only neighboring lines in the status structure, and the events of type (iii) involve 
the lowest increasing (decreasing) half-line and the decreasing (increasing) half-lines that are in the zone when they 
are intersected by the sweep line. To maintain the status structures and compute new events we need constantly 
many queries and updates to our status structures and event queue. Hence, each event can be handled in O(logn) 
time. 

The events of type (i) are known initially. The first events of type (ii) and (iii) can be computed in (9(log n) 
time per event (by inserting the half-lines in the status structures). So, initializing our status structures and event 
queue takes O(nlogn) time. During the sweep we handle 0{n) events, each taking O(logn) time. Therefore, we 
can compute the zone of A ' in 0(n log n) time in total. 

Computing a minimum weight path. We can slightly improve the running time by reducing the time required to 
compute a minimum weight path. If, instead of a general graph Q = ( V , E) we have a directed acyclic graph (DAG), 
we can compute am minimum weight path in only 0(|F| + \E\) = 0{rn 2 ) time using dynamic programming. We 
transform Q into a DAG by orienting all edges e = (u, v). with tu ^ ty ? from u to v. 

The running time is now dominated by constructing the graph. We conclude: 

Theorem 27. Given a set ofn trajectories in R 1 , each with vertices at times to,... t T , we can compute a central 
trajectory C for e = 0 in 0(rn 2 logn) time using 0(rn 2 ) space. 
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